SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b- 
2,p2n,rnL 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09-7: 

329b-2.p2n.rni. 

start 

Go Back to previoi 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein 
Run on: 



nucleic search, using f rame_plus_p2n model 



July 25, 2006, 09:19:25 



Search time 269 Seconds 
(without alignments) 
4465.620 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-734-329B-2 
2384 

1 MAS S LLEEEAHYGSS PLAML PAPPEKAHGGSPEQSNLLEI 428 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 1403666 seqs, 935554401 residues 

Total number of hits satisfying chosen parameters: 



2807332 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US09734 32 9/runat_2507200 6_091908_3760/app_query. fasta_l 
-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=p2n . rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS =human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
- MOD E= LOCAL -OUTFMT-pto -NORM=ext -HEAPS I ZE= 5 00 -MINLEN=0 -MAXLEN=20 0000 0000 
-HOST=abss03h -USER=US09734329_@CGN_l_l_204_@runat_25072006_091908_37 60 
-NCPU=6 -ICPU=3 -NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database 



Issued_Patents_NA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/ina/l_COMB. seq : * 
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2 : /EMC_Celerra_SIDS3/ptodata/2/ina/5_COMB.seq:* 

3 : /EMC_Celerra_SIDS3/ptodata/2/ina/6A_COMB.seq:* 

4 : /EMC_Celerra_SIDS3/ptodata/2/ina/6B_COMB.seq:* 

5 : /EMC_Celerra_SIDS3/ptodata/2/ina/7_COMB.seq:* 

6 : /EMC_Celerra_SIDS3/ptodata/2/ina/H_COMB. seq: * 

7 : /EMC_Celerra_SIDS3/ptodata/2/ina/PCTUS_COMB.seq:* 

8 : /EMC_Celerra_SIDS3/ptodata/2/ina/PP_COMB.seq:* 

9 : /EMC_Celerra_SIDS3/ptodata/2/ina/RE_COMB.seq:* 

10 : /EMC_Celerra_SIDS3/ptodata/2/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 


1 


818.5 


34 . 


3 


3471 


4 


US- 


10- 


094 


-749-1170 


Sequence 


1170, Ap 


2 


505 


21 . 


2 


3289 


3 


US- 


09- 


620 


-312D-436 


Sequence 


436, App 


3 


459.5 


19 . 


3 


2984 


3 


US- 


09- 


949 


-016-4299 


Sequence 


4299, Ap 


4 


453 


19 . 


0 


3937 


3 


us- 


10- 


164 


-595-7 


Sequence 


7, Appli 


5 


449.5 


1 O 

18 . 


9 


3722 


3 


US- 


10- 


164 


-595-9 


Sequence 


9, Appli 


6 


449.5 


18 . 


9 


3862 


3 


US- 


10- 


164 


-595-5 


Sequence 


5, Appli 


7 


449.5 


18 . 


y 


3985 


3 


US- 


10- 


164 


-595-3 


Sequence 


3, Appli 


8 


437.5 


1 o 
18 . 


A 


1430 


3 


us- 


09- 


492 


-985-1 


Sequence 


1, Appli 


9 


437.5 


1 Q 

lo . 


A 

H 


1471 


3 


US- 


09- 


492 


-985-11 


Sequence 


11, Appl 


10 


424.5 


1 / . 


o 
8 


1057 


3 


us- 


09- 


949 


-016-4554 


Sequence 


4554, Ap 


11 


418.5 


1 / . 


b 


1079 


3 


us- 


10- 


131 


-827-8867 


Sequence 


8867, Ap 


12 


402 


Id . 


y 


4863 


3 


us- 


09- 


949 


-016-3725 


Sequence 


3725, Ap 


13 


393 


lb . 


c 
D 


4859 


3 


us- 


09- 


814 


-915A-34 


Sequence 


34, Appl 


14 


393 


lb . 


c 
D 


4859 


3 


us- 


09- 


949 


-016-168 


Sequence 


168, App 


15 


393 


lb . 


c 
O 


4859 


4 


us- 


09- 


880 


-107-1647 


Sequence 


1647, Ap 


16 


366.5 


1 R 


A 
H 


589 


3 


us- 


09- 


229 


-037-29 


Sequence 


29, Appl 


17 


366.5 


1 R 


A 
H 


589 


3 


us- 


09- 


478 


-681-29 


Sequence 


2 9, Appl 


18 


366.5 


1 S 
±D . 


A 
H 


589 


3 


us- 


09- 


779 


-233-17 


Sequence 


1 / , Appl 


19 


366.5 


±o * 


A 
H 


589 


3 


us- 


09- 


706 


-243A-29 


Sequence 


2 9 , Appl 


20 


366.5 


15 . 


4 


589 


3 


us- 


09- 


942 


-087A-29 




9 Q T\ nn 1 


21 


366.5 


15. 


4 


589 


4 


us- 


09- 


897 


-844A-29 


Sequence 


29, Appl 


22 


366.5 


15. 


4 


589 


4 


us- 


10- 


412 


-105-17 


Sequence 


17, Appl 


23 


366.5 


15. 


4 


589 


5 


us- 


09- 


844 


-508-28 


Sequence 


28, Appl 


24 


366.5 


15. 


4 


589 


5 


us- 


10- 


245 


-415B-29 


Sequence 


29, Appl 


25 


360.5 


15. 


1 


1734 


3 


us- 


09- 


949 


-016-2024 


Sequence 


2024, Ap 


26 


359 


15. 


1 


298 


3 


us- 


09- 


229 


-037-14 


Sequence 


14, Appl 


27 


359 


15. 


1 


298 


3 


us- 


09- 


478 


-681-14 


Sequence 


14, Appl 


28 


359 


15. 


1 


298 


3 


us- 


09- 


779 


-233-2 


Sequence 


2, Appli 


29 


359 


15. 


1 


298 


3 


us- 


09- 


706 


-243A-14 


Sequence 


14, Appl 


30 


359 


15. 


1 


298 


3 


us- 


09- 


942 


-087A-14 


Sequence 


14, Appl 


31 


359 


15. 


1 


298 


4 


us- 


09- 


897 


-844A-14 


Sequence 


14, Appl 


32 


359 


15. 


1 


298 


4 


us- 


10- 


412 


-105-2 


Sequence 


2, Appli 


33 


359 


15. 


1 


298 


5 


us- 


09- 


844 


-508-10 


Sequence 


10, Appl 


34 


359 


15. 


1 


298 


5 


us- 


10- 


245 


-415B-14 


Sequence 


14, Appl 


35 


353.5 


14. 


8 


1889 


2 


us- 


08- 


946 


-241B-1 


Sequence 


1, Appli 


36 


353.5 


14. 


8 


1889 


2 


us- 


08- 


946 


-241B-8 


Sequence 


8, Appli 


37 


353.5 


14. 


8 


1889 


3 


us- 


09- 


309 


-053-1 


Sequence 


1, Appli 


38 


353.5 


14. 


8 


1889 


3 


us- 


09- 


309 


-053-8 


Sequence 


8, Appli 


39 


353.5 


14. 


8 


2639 


3 


us- 


10- 


087 


-641-2 


Sequence 


2, Appli 


40 


353.5 


14. 


8 


2639 


4 


us- 


10- 


087 


-346-2 


Sequence 


2, Appli 


41 


351.5 


14. 


7 


1953 


3 


us- 


09- 


814 


-915A-85 


Sequence 


85, Appl 


42 


351 


14. 


7 


292 


3 


us- 


09- 


395 


-448-14 


Sequence 


14, Appl 
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43 


351 


14 


.7 


292 


3 


US-09-925-796-14 


44 


351 


14 


.7 


292 


3 


US-09-941-450-14 


45 


351 


14 


.7 


298 


3 


US-09-229-037-16 



Sequence 14 , Appl 
Sequence 14, Appl 
Sequence 16, Appl 



ALIGNMENTS 



RESULT 1 

US-10-094-749-1170 

Sequence 1170, Application US/10094749 
Patent No. 6979557 
GENERAL INFORMATION: 
APPLICANT: ISOGAI, TAKAO 
APPLICANT: SUGIYAMA, TOMOYASU 
APPLICANT: OTSUKI, TETSUJI 
APPLICANT: WAKAMATSU, AI 
APPLICANT: SATO, HIROYUKI 
APPLICANT: ISHII, SHIZUKO 
APPLICANT: YAMAMOTO, JUN-ICHI 
AP PLI CANT : I S ONO , YUUKO 
APPLICANT: HIO, YURI 
APPLICANT: OTSUKA, KAORU 
APPLICANT: NAGAI, KEIICHI 
APPLICANT: IRIE, RYOTARO 
APPLICANT: TAMECHIKA, ICHIRO 
APPLICANT: SEKI, NAOHIKO 
APPLICANT : YOSHIKAWA, TSUTOMU 
APPLICANT: OTSUKA, MOTOYUKI 
APPLICANT: NAGAHARI, KENJI 
APPLICANT: MASUHO, YASUHIKO 
TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
FILE REFERENCE: 084335/0160 
CURRENT APPLICATION NUMBER: US/10/094,749 
CURRENT FILING DATE: 2002-03-12 
PRIOR APPLICATION NUMBER: 60/350,435 
PRIOR FILING DATE: 2002-01-24 
PRIOR APPLICATION NUMBER: JP 2001-328381 
PRIOR FILING DATE: 2001-09-14 
NUMBER OF SEQ ID NOS : 3381 
SOFTWARE: Patentln Ver . 2.1 
SEQ ID NO 1170 
LENGTH: 3471 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-1170 



Alignment Scores: 

Pred. No.: 5.34e-39 

Score: 818.50 

Percent Similarity: 53.6% 
Best Local Similarity: 42.5% 

Query Match: 34.3% 

DB: 4 



Length: 3471 

Matches: 197 

Conservative: 51 

Mismatches: 152 

Indels: 63 

Gaps: 18 



US-09-734-329B-2 (1-428) x US-10-094-749-1170 (1-3471) 



Qy 

Db 



89 



MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
MMM:::MIIII Ml I F I I I E I I i I I I : : : I I I I I I I I I I I I I M 

ATGGCAAC T TC AC T T CTAGGGGAAGAAC CGAGGT TGGGAT C GAC TCC T CTGGCC AT GC T T 148 



Qy 



21 ThrAlaAlaCysSerLysPheGlyGlySerSerPro LeuArgAsp 35 
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SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b- 
2,p2n,rnpbm. 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09-7: 

329b-2.p2n.rnpbm. 

start 

Go Back to previous 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: July 25, 2006, 09:20:29 ; Search time 1639 Seconds 

(without alignments) 
4813.094 Million cell updates/sec 

Title: US-09-734-32 9B-2 

Perfect score: 2384 

Sequence: 1 MASSLLEEEAHYGSS PLAML PAPPEKAHGGS PEQSNLLE I 428 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 18892170 seqs, 6143817638 residues 

Total number of hits satisfying chosen parameters: 37784340 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US09734 329/runat_25072006_091917_3923/app_query. fasta_l 
-DB=Published_Applications_NA_Main -QFMT=f astap -SUFFIX=p2n. rnpbm 
-MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 
-MATRIX=blosum62 -TRANS =human4 0 . cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct 
-THR_MAX=100 -THR_MIN-0 -ALIGN=15 -M0DE= LOCAL -0UTFMT=pto -N0RM=ext 
-HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -HOST=abss02h 

-USER=US09734329_@CGN_l_l_1675_@runat_25072006_091917_3923 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 - THREADS =1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAP0P=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : Published_Applications NA Main:* 
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1 : /EMC_Celerra__SIDS3/ptodata/2/pubpna/US07_PUBCOMB. seq:* 

2 : /EMC_Cele r r a_S IDS 3 /ptoda ta /2 /pubpna/US 0 8_PUBC0MB .seq:* 

3 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US09B_PUBCOMB. seq:* 

5 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

6: /EMC_Celerra_SIDS3/ptodata/2/ptibpna/US10A_PUBCOMB.seq:* 

7 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10B_PUBCOMB. seq:* 

8 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 

9 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10D_PUBCOMB. seq: * 
10: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10E_PUBCOMB.seq: * 
11: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10F_PUBCOMB.seq: * 
12: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10G_PUBCOMB.seq:* 
13: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllA_PUBCOMB.seq:* 
14 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllB_PUBCOMB . seq: * 
15: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllC_PUBCOMB.seq:* 
16: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD_PUBCOMB.seq:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2384 


100 


. 0 


2960 


3 


US-09-734-329-1 


Sequence 1, Appli 


2 


818.5 


34 


. 3 


3471 


7 


US-10-094-749-1170 


Sequence 1170, Ap 


3 


686 


28 


. 8 


2076 


13 


US-11-097-143-848 


Sequence 848, App 


4 


676 


28 


.4 


1675 


10 


US-10-450-763-15125 


Sequence 15125, A 


5 


671.5 


28 


.2 


1206 


7 


US-10-029-386-20198 


Sequence 20198, A 


6 


650.5 


27 


.3 


4311 


13 


US-11-097-143-847 


Sequence 847, App 


7 


553.5 


23 


.2 


1830 


7 


US-10-312-680-1 


Sequence 1, Appli 


8 


551 . 5 


23 


.1 


1197 


7 


US-10-312-680-3 


Sequence 3, Appli 


9 


505 


21 


.2 


2862 


8 


US-10-479-435-50 


Sequence 50, Appl 


10 


505 


21 


.2 


3024 


10 


US-10-888-875-1 


Sequence 1, Appli 


11 


505 


21 


.2 


3289 


6 


US-10-037-270-436 


Sequence 436, App 


12 


505 


21 


.2 


3289 


7 


US-10-117-722-436 


Sequence 436, App 


13 


505 


21 


.2 


3289 


10 


US-10-122-851-436 


Sequence 436, App 


14 


493.5 


20 


.7 


3187 


13 


US-11-097-143-11864 


Sequence 118 64, A 


c 15 


493.5 


20 


.7 


5187 


13 


US-11-097-143-11863 


Sequence 11863, A 


16 


479.5 


20 


.1 


2475 


16 


US-11-136-527-2078 


Sequence 2078, Ap 


17 


453 


19 


.0 


3936 


10 


US-10-888-875-3 


Sequence 3, Appli 


18 


453 


19 


.0 


3937 


10 


US-10-717-665-7 


Sequence 7, Appli 


19 


449.5 


18 


.9 


3709 


6 


US-10-044-090-360 


Sequence 360, App 


20 


449.5 


18 


.9 


3722 


10 


US-10-717-665-9 


Sequence 9, Appli 


21 


449.5 


18 


.9 


3862 


10 


US-10-717-665-5 


Sequence 5, Appli 


22 


449.5 


18 


.9 


3985 


10 


US-10-717-665-3 


Sequence 3, Appli 


23 


438 


18 


.4 


4285 


6 


US-10-012-600B-57 


Sequence 57, Appl 


24 


422 


17 


.7 


600 


16 


US-11-136-527-6174 


Sequence 6174, Ap 


25 


418.5 


17 


.6 


1079 


7 


US-10-131-827-8867 


Sequence 8867, Ap 


26 


414.5 


17 


.4 


1935 


13 


US-11-097-143-38099 


Sequence 38099, A 


c 27 


414 


17 


.4 


3388 


13 


US-11-097-143-13342 


Sequence 13342, A 


28 


410.5 


17 


.2 


1133 


10 


US-10-956-157-2550 


Sequence 2550, Ap 


29 


408 


17 


.1 


1615 


7 


US-10-264-049-704 


Sequence 704, App 


30 


393.5 


16 


.5 


882 


3 


US-09-764-864-642 


Sequence 642, App 


31 


393.5 


16 


.5 


900 


3 


US-09-764-864-211 


Sequence 211, App 


32 


393 


16 


.5 


4859 


3 


US-09-954-456-493 


Sequence 4 93, App 


33 


393 


16 


.5 


4859 


3 


US-09-880-107-1647 


Sequence 1647, Ap 


34 


393 


16 


.5 


4859 


6 


US-10-171-311-21 


Sequence 21, Appl 


35 


393 


16 


.5 


4859 


7 


US-10-172-118-572 


Sequence 572, App 
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36 


393 


16. 


5 


4859 


37 


393 


16. 


5 


4859 


38 


393 


16. 


5 


4859 


39 


393 


16. 


5 


4859 


40 


393 


16. 


5 


4859 


41 


393 


16. 


5 


4859 


42 


393 


16. 


5 


5450 


43 


392 


16. 


4 


2129 


44 


391.5 


16. 


4 


2721 


45 


391.5 


16. 


4 


2721 



8 US-10-342-887-572 
8 US-10-776-827-34 
10 US-10-843-641A-3520 
10 US-10-956-157-574 
10 US-10-786-148-88 
12 US-10-784-004-1192 

6 US-10-044-090-504 

7 US-10-108-260A-1978 

7 US-10-388-934-122 

8 US-10-191-803-285 



Sequence 572, App 
Sequence 34, Appl 
Sequence 3520, Ap 
Sequence 574, App 
Sequence 88, Appl 
Sequence 1192, Ap 
Sequence 504, App 
Sequence 1978, Ap 
Sequence 122, App 
Sequence 285, App 



ALIGNMENTS 

RESULT 1 
US-09-734-329-1 

; Sequence 1, Application US/09734329 
; Patent No. US20020156031A1 
; GENERAL INFORMATION: 

; APPLICANT: de CROMBRUGGHE, BENOIT 
; APPLICANT: NAKASHIMA, KAZUHISA 
; APPLICANT: ZHOU, XIN 

TITLE OF INVENTION: MASTER BONE FORMATION TRANSCRIPTION FACTOR: 
; TITLE OF INVENTION: COMPOSITIONS AND METHODS OF USE 
; FILE REFERENCE: UTXC:666 

; CURRENT APPLICATION NUMBER: US/09/734,329 

; CURRENT FILING DATE: 2000-11-30 

; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 1 

LENGTH: 2 960 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (100) .. (1383) 
US-09-734-329-1 



Alignment Scores: 



Pred. No.: 


2.31e-196 


Length : 


2960 


Score : 


2384.00 


Matches : 


428 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches : 


0 


Query Match: 


100.0% 


Indels : 


0 


DB: 


3 


Gaps: 


0 



US-09-734-329B-2 (1-428) x US-09-734-329-1 (1-2960) 



Qy 


i 


MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 






1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 ! I I I I 1 I | I ! | | M 1 II 1 1 1 1 1 1 1 1 M 


Db 


100 


ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 159 


Qy 


21 


ThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThrThrLeuGly 40 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 


Db 


160 


ACTGCAGCCTGCAGCAAAT TTGGC GGCTC TAGCCCT CTGC GGGAC TCAACAACCC T GGGG 219 


Qy 


41 


LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 60 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


Db 


220 


AAAGGAGGCAC AAAGAAGC CAT AC GC TGACC T T TCAGC C C CC AAAACC ATGGGGGAC GCC 27 9 


Qy 


61 


TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 80 
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SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b- 
2,p2n,rnpbn, 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09- 

734-329b-2.p2n.rnpbn. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plusjp2n model 

Run on: July 25, 2006, 09:24:46 ; Search time 430 Seconds 

(without alignments) 
2144.788 Million cell updates/sec 

Title : US-09-734-329B-2 
Perfect score: 2384 

Sequence: 1 MASSLLEEEAHYGSSPLAML PAPPEKAHGGSPEQSNLLEI 428 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 899801 seqs, 718270062 residues 

Total number of hits satisfying chosen parameters: 17 99602 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US09734329/runat_25072006_091919_3956/app_query.fasta_l 
-DB=Published_Applications_NA_New -QFMT=fastap -SUFFIX=p2n. rnpbn -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
- TRANS =human 40 . cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 - MODE = LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 -HOST=abss03h 

-USER-US09734329_@CGN_l_l_150_@runat_25072006_091919_3956 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 - THREADS =1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0 .5 -DELOP=6 -DELEXT=7 

Database : Published_Applications_NA_New: * 
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1: /EMC_Celerra_SIDS3/ptodata/l/pubpna/US09_NEW_PUB.seq:* 

2 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/US06_NEW_PUB.seq:* 

3 : /EMC_Cele r r a_S IDS 3 /ptoda ta / 1 /pubpna/US 0 7_NEW_PUB . s eq : * 

4 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/US08_NEW_PUB.seq: * 

5 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 

6: /EMC_Celerra_SIDS3/ptodata/l/pubpna/US10_NEW_PUB.seq:* 

7 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/USll_NEW_PUB.seq:* 

8 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/USll_NEW_PUB. seql : * 

9 : /EMC_Celerra_SIDS3/ptodata/l/pubpna/US60_NEW_PUB.seq:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 




X 


674 


O Q 
id . 


J 


3794 


8 


US- 


-11- 


-266- 


-748A-31703 


Sequence 


31703, A 




o 


604.5 


zz> . 


A 

H 


970 


8 


US - 


-11- 


-266- 


-748A-92977 


Sequence 


92977, A 


c 


•J 


604.5 


Z J . 


A 

ft 


970 


8 


US- 


-11- 


-266- 


-748A-145788 


Sequence 


145788, 




A 


551.5 




1 


1812 


8 


US- 


■11- 


-266- 


-748A-28121 


Sequence 


28121, A 




c 
0 


505 


O 1 
c. X . 


o 


2655 


8 


US- 


-11- 


-266- 


-748A-28794 


Sequence 


28794, A 




£L 
D 


505 


Zi . 




7667 


7 


US- 


-11- 


-317- 


-330A-18 


Sequence 


18, Appl 


c 


n 
/ 


459.5 


1 Q 

X3 . 




1678 


8 


US- 


-11- 


-266- 


-748A-262860 


Sequence 


262860, 




a 
O 


459.5 






1678 


8 


US- 


-11- 


-266- 


-748A-323377 


Sequence 


323377, 




9 


453 


X -7 ■ 




3979 


8 


US- 


-11- 


-266- 


-748A-24836 


Sequence 


24836, A 




x u 


449.5 


1 p 
XO . 


Q 


1135 


8 


US- 


-11- 


-266- 


■748A-193673 


Sequence 


193673, 




i i 

X X 


449.5 


1 P 


Q 


1135 


8 


US- 


-11- 


-266- 


■748A-227203 


Sequence 


227203, 




1 9 
x ^ 


449.5 


1 P 


Q 


2217 


8 


US- 


-11- 


-266- 


-748A-29033 


Sequence 


29033, A 




13 


449.5 


1 R 


Q 


2529 


8 


US- 


-11- 


-266- 


-748A-27455 


Sequence 


27455, A 




14 


449.5 


18 . 


9 


3920 


8 


US- 


■11- 


-266- 


748A-29348 


Sequence 


O Q O A Q 7\ 
ZjOHOf A 




15 


447.5 


18 . 


3 


1522 


8 


US- 


■11- 


-266- 


748A-353547 


Sequence 


O C O C A 1 

OD Jb H 1 f 




1 6 


447.5 


IO • 


o 
o 


1522 


8 


US- 


-11- 


-266- 


748A-384508 


Sequence 


384508 , 


c 


17 


447.5 


18 . 


8 


1522 


8 


US- 


•11- 


-266- 


748A-436926 


o equence 






18 


445 


18 . 


7 


1000 


8 


US- 


■11- 


-266- 


748A-284152 


ocquence 


ZO410Z/ 




19 


445 


18 . 


7 


1000 


8 


US- 


-11- 


-266- 


748A-335581 


Sequence 


OOCC01 

Jooob JL , 




20 


438 


18. 


4 


2888 


8 


us- 


11- 


-145- 


307A-253 


fc-* w v»> ± iv^ w 


253 Ann 




21 


438 


18. 


4 


3031 


8 


us- 


11- 


-266- 


748A-56359 


Sequence 


56359, A 




22 


438 


18. 


4 


3288 


8 


us- 


11- 


-266- 


748A-28709 


Sequence 


28709, A 


c 


23 


401 . 5 


16. 


8 


2961 


8 


us- 


11- 


-266- 


748A-80279 


Sequence 


80279, A 




24 


401.5 


16. 


8 


2961 


8 


us- 


11- 


-266- 


748A-133090 


Sequence 


133090, 




25 


399.5 


16. 


8 


1000 


8 


us- 


11- 


-266- 


748A-291349 


Sequence 


291349, 


c 


26 


399.5 


16. 


8 


1000 


8 


us- 


11- 


■266- 


748A-342778 


Sequence 


342778, 




27 


393.5 


16. 


5 


842 


8 


us- 


11- 


■266- 


748A-363880 


Sequence 


363880, 




28 


393.5 


16. 


5 


842 


8 


us- 


11- 


■266- 


748A-388190 


Sequence 


388190, 


c 


29 


393.5 


16. 


5 


842 


8 


us- 


11- 


■266- 


748A-447259 


Sequence 


447259, 




30 


393 


16. 


5 


4859 


8 


us- 


11- 


-266- 


748A-28713 


Sequence 


28713, A 




31 


392 


16. 


4 


2129 


8 


us- 


11- 


293- 


697-1978 


Sequence 


1978, Ap 




32 


371 


15. 


6 


1000 


8 


us- 


11- 


-266- 


748A-403925 


Sequence 


403925, 


c 


33 


371 


15. 


6 


1000 


8 


us- 


11- 


266- 


748A-474971 


Sequence 


474971, 




34 


370.5 


15. 


5 


1618 


8 


us- 


11- 


266- 


748A-29737 


Sequence 


29737, A 




35 


369.5 


15. 


5 


499 


8 


us- 


11- 


266- 


748A-141 


Sequence 


141, App 




36 


364 .5 


15. 


3 


1152 


6 


us- 


10- 


511- 


937-402 


Sequence 


402, App 




37 


358 


15. 


0 


1668 


8 


us- 


11- 


266- 


748A-358069 


Sequence 


358069, 


c 


38 


358 


15. 


0 


1668 


8 


us- 


11- 


266- 


748A-441448 


Sequence 


441448, 




39 


354.5 


14. 


9 


1869 


8 


us- 


11- 


293- 


697-1940 


Sequence 


1940, Ap 




40 


353.5 


14. 


8 


2639 


8 


us- 


11- 


235- 


814-2 


Sequence 


2, Appli 




41 


351.5 


14. 


7 


1953 


8 


us- 


11- 


266- 


748A-32031 


Sequence 


32031, A 




42 


346.5 


14. 


5 


1953 


8 


us- 


11- 


266- 


748A-186388 


Sequence 


186388, 
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43 345 14.5 2908 8 US-11-235-814-1 Sequence 1, Appli 

44 342.5 14.4 1000 8 US-11-266-748A-117354 Sequence 117354, 

45 342.5 14.4 1000 8 US-11-266-748A-159518 Sequence 159518, 



ALIGNMENTS 

RESULT 1 

US-11-266-748A-31703 

Sequence 31703, Application US/11266748A 
Publication No. US20060134663A1 
GENERAL INFORMATION: 
APPLICANT: Harkin, Paul 
APPLICANT: Johnston, Patrick 
APPLICANT: Mulligan, Karl 

TITLE OF INVENTION: Transcriptome Microarray Technology and 
TITLE OF INVENTION: Methods of Using the Same 
FILE REFERENCE: 55815-0102 (319189) 
CURRENT APPLICATION NUMBER: US/11/2 66, 748A 
CURRENT FILING DATE: 2005-11-03 
PRIOR APPLICATION NUMBER: EP 0410547 9.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105482.6 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105483.4 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105507.0 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105485.9 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105484.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: US 60/662,276 
PRIOR FILING DATE: 2005-03-14 
PRIOR APPLICATION NUMBER: US 60/700,293 
PRIOR FILING DATE: 2005-07-18 
NUMBER OF SEQ ID NOS : 483996 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 31703 
LENGTH: 3794 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-31703 

Alignment Scores : 

Pred. No.: 3.09e-20 Length: 3794 

Score: 674.00 Matches: 176 

Percent Similarity: 4 9.3% Conservative: 43 

Best Local Similarity: 39.6% Mismatches: 131 

Query Match: 28.3% Indels : 96 

DB: 8 Gaps: 20 

US-09-734-329B-2 (1-428) x US-11-266-748A-31703 (1-3794) 

Qy 18 AlaMetLeuThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThr 37 

I I I I I I I I I I I I I I I III 
Db 276 GCAATGCTAACCGCTGTCTGCGGC 299 

Qy 38 ThrLeuGlyLysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMet 57 

: : : I I I I I I III::: I I I : : : I II : : : I I I I I I : : : 
Db 300 TC T C TGGGC AGCCAGCAC AC GGAAGC GC C GC ACGC C TCCC C - GC CGC GCC T C GA 352 
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SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b 
2,p2n,rng, 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09- 
734-329b-2.p2n.rng. 

start 

Go Back to previous paoe 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus j>2n model 

Run on: July 25, 2006, 09:18:39 ; Search time 845 Seconds 

(without alignments) 
5297.259 Million cell updates/sec 

Title: US-09-734-329B-2 
Perfect score: 2384 

Sequence: 1 MASSLLEEEAHYGSS PLAML PAPPEKAHGGSPEQSNLLEI 428 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 10489840 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US09734329/runat_2507200 6_091900_3652/app_query. fasta_l 
-DB=N_Geneseq -QFMT=fastap -SUFFIX=p2n. rng -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 - TRANS =human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE= LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -HOST=abss05h 
-USER=US09734329_@CGN_l_l_761_@runat_25072006_091900_3652 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG - DEV_T IMEOUT=l 2 0 
-WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 -FGAPEXT =7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : N_Geneseq_8 : * 

1: geneseqnl980s : * 
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2: geneseqnl990s : * 

3: geneseqn2000s : * 

4 : geneseqn2001as: * 

5 : geneseqn2001bs: * 

6 : geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as:* 

9 : geneseqn2003bs: * 

10 : geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12: geneseqn2004as : * 

13: geneseqn2004bs : * 

14: geneseqn2005s : * 

15: geneseqn2006s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2384 


i no 

1UU 


n 
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6 
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Aad40709 Mouse ost 


2 
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Q ^ 


■ u 
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ADR08250 


Adr082 50 Full leng 


3 
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o D 


p 
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1737 


12 


ADP28826 


Adp28826 Human sec 


c 4 


836.5 


0 O 


1 


3128 


13 


ADR07977 


Adr07977 Full leng 


5 


818.5 




0 
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3471 


10 


ADAS 3 602 


Ada53602 Human cod 


6 


818.5 




0 


3471 


14 


AEB53898 


Aeb538 98 DNA encod 


7 


774.5 






1473 


12 


ADO00467 


Ado004 67 Novel hum 


8 


774.5 


17 


. «j 


1473 


12 


ADN98898 


Adn988 98 Novel hum 


9 
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28 
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4 


ABL02405 


Abl02405 Drosophil 


10 
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28 


. 4 
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5 


AAS79321 


Aas79321 DNA encod 


11 
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28 


.3 
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13 


ADR07645 


Adr07645 Full leng 


12 


671.5 


28 


.2 


1206 


12 


ACH87003 


Ach87003 Human gen 


13 


669 


28 


.1 
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13 


ADR07511 


Adr07511 Full leng 


14 
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27 


.3 
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4 


ABL02404 
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15 
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23 


.2 
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6 
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16 
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23 


.2 
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14 


ADX83136 


Adx83136 Human TEG 


17 
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23 


.1 
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6 


ABA96949 


Aba96949 Human Spl 


18 


551.5 


23 


.1 


1554 


10 


ADC30435 


Adc30435 Human nov 


19 


505 


21 


.2 


2862 


10 


AAD51694 


Aad51694 Human nuc 


20 


505 


21 


.2 


3024 


14 


ADZ72114 


Adz72114 Human tra 


21 


505 


21 


.2 


3289 


4 


AAI58549 


Aai5854 9 Human pol 


22 


505 


21 


.2 


3289 


5 


ADQ98766 


Adq98766 DNA encod 


23 


505 


21 


.2 


3289 


9 


ADB48526 


Adb48526 Novel hum 


24 


501.5 


21 


.0 


3090 


4 


AAI60335 


Aai 60335 Human pol 


25 
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20 


.7 
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4 


ABL09749 


Abl0974 9 Drosophil 
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.7 
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4 


ABL09748 


Abl09748 Drosophil 


27 
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Adz72116 Human tra 


28 


453 
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10 
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18 
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Adt07445 Human col 


30 


449.5 
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.9 
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13 


ADT07444 


Adt07444 Human col 


31 


449.5 


18 


.9 
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13 


ADT07441 


Adt07441 Human col 


32 


449.5 


18 


.9 


3430 


13 


ADT07439 


Adt07439 Human col 


33 


449.5 


18 


.9 


3448 


13 


ADT07440 


Adt07440 Human col 


34 


449.5 
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.9 


3548 
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Abk84634 Human cDN 


35 


449.5 
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3548 


14 


ADX07998 


Adx07 998 Cyclin-de 


36 


449.5 


18 
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Aef74819 Human pol 
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.9 


3707 


13 
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Adt07438 Human col 
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38 
39 
40 
41 
42 
43 
44 
45 



449.5 
449.5 
449.5 
449.5 
449.5 
449.5 



438 
438 



18.9 
18.9 
18.9 
18.9 
18.9 
18.9 
18.4 
18.4 



3709 
3722 
3763 
3862 
3920 
3985 
2888 
3288 



8 ABX63360 
10 ADK65770 

14 AEA19695 
10 ADK65766 
13 ADR83463 
10 ADK65764 

15 AEE80965 
6 ABK83634 



Abx63360 Human cDN 
Aclk657 70 Angiogene 
Aeal9695 Novel hum 
Adk657 66 Angiogene 
Adr834 63 Human Sp3 
Adk657 64 Angiogene 
Aee80965 Human cDN 

Abk83634 Human cDN 



ALIGNMENTS 



RESULT 1 
AAD40709 

ID AAD40709 standard; cDNA; 2960 BP. 
XX 

AC AAD40709; 
XX 

DT 30-OCT-2002 (first entry) 
XX 

DE Mouse osterix cDNA. 
XX 

KW Bone formation; transcription factor; osteoblast; Paget ! s disease; 

KW osterix protein; glucocorticoid; osteoporosis; cytostatic; cytokine; 

KW periodontal disease; tooth loss; bone fracture; rheumatoid arthritis; 

KW metastatic bone disease; gene therapy; growth factor; osteopathic; 

KW differentiation; mouse; chromosome 15; gene; ss. 
XX 

OS Mus musculus . 
XX 

FH Key Location/Qualifiers 

FT CDS 100. .1386 

FT /*tag= a 

FT /product= "Mouse osterix protein" 
XX 

PN WO200244380-A2. 
XX 

PD 06-JUN-2002. 
XX 

PF 30-NOV-2001; 2001WO-US044898 . 
XX 

PR 30-NOV-2000; 2000US-00734329 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI De Crombrugghe B, Nakashima K, Zhou X; 
XX 

DR WPI; 2002-519587/55. 

DR P-PSDB; AAE2507 6. 
XX 

PT Novel DNA segment encoding Osterix polypeptide which is a master 

PT transcription factor that controls osteoblast differentiation and is 

PT useful for treating osteoporosis, in patient by stimulating bone 

PT formation. 

XX 

PS Claim 17; Page 138-141; 144pp; English. 
XX 

CC The invention relates to a master bone formation transcription factor 

CC that controls osteoblast differentiation, osterix protein and its 

CC corresponding nucleic acid sequence. Osterix protein and its DNA and 

CC agents that interact with the protein to activate or stimulate the 



http://es/ScoreAccessWeb/Gedtem.action?AppId=09734329&seqId=568837&ItemNam 8/1/06 



CC differentiation of bone cells is used for treatment of glucocorticoid 

CC induced osteoporosis, Paget ! s disease, periodontal disease, tooth loss, 

CC bone fractures, rheumatoid arthritis, metastatic bone disease, etc. 

CC Osterix DNA is useful as probes or primers in nucleic acid hybridisation 

CC experiments. It is also used in gene therapy. Osterix protein is useful 

CC for controlling bone formation, serves as receptors of soluble molecules, 

CC e.g. cytokines, growth factors, etc, as homing/adhesion/ rolling 

CC receptors mediating the migration of osteoblast, as signalling receptors, 

CC thereby regulating function of osteoblasts, and/or ligands of signalling 

CC receptors or osteoblast. It is also useful for purifying osteoblasts, 

CC from a test composition suspected of containing the cells. The present 

CC sequence is mouse osterix cDNA. Mouse osterix gene is located at 

CC chromosome 15 

XX 

SQ Sequence 2960 BP; 647 A; 873 C; 719 G; 721 T; 0 U; 0 Other; 



Alignment Scores: 








Pred. No. : 


1.51e-87 


Length: 


2960 


Score : 


2384.00 


Matches : 


428 


Percent Similarity: 


100.0% 


Conservative : 


0 


Best Local Similarity: 


100.0% 


Mismatches : 


0 


Query Match: 


100.0% 


Indels : 


0 


DB: 


6 


Gaps : 


0 



US-09-734-329B-2 (1-428) x AAD40709 (1-2960) 

MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
I I I I I I M I I I I I I I I I I t I I I I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I M I I 
ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 159 

ThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThrThrLeuGly 40 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AC T GCAGCC TGC AGC AAAT T TGGC GGC TC TAGCCC TC T GC GGGAC TCAACAAC C C T GGGG 219 

LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAAGGAGGCACAAAGAAGC CATAC GCTGACCT TTCAGCCCCCAAAACC ATGGGGGACGCC 279 

TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 80 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
TACCCAGCTCCCTTCTCAAGCACCAATGGACTCCTCTCTCCTGCAGGCAGTCCTCCGGCC 339 

ProAlaSerGlyTyrAlaAsnAspTyrProProPheProHisSerPheProGlyProThr 100 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAGCCTCTGGCTATGCAAATGACTACCCACCCTTCCCTCACTCATTTCCTGGGCCCACC 399 

GlyAlaGlnAspProGlyLeuLeuValProLysGlyHisSerSerSerAspCysLeuPro 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I II I I I I I I I I I I 
GGTGCCCAAGACCCTGGGCTCCTAGTGCCTAAGGGGCACAGCTCGTCTGACTGCCTGCCT 459 

SerValTyrThrSerLeuAspMetThrHisProTyrGlySerTrpTyrLysAlaGlylle 140 

M I II M I I II If I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGT GTC T AC AC T TC C C TGGATATGAC TC ATC C C TAT GGC T CGTGGTAC AAGGCAGGC ATC 519 

HisAlaGlylleSerProGlyProGlyAsnThrProThrProTrpTrpAspMetHisPro 160 

I I f I I 1 I I I t I I 1 I I I f I I 1 I I I I I I I t I I I I I I I f I I f I I I I ! I I I I I I I I 1 II 

CAC GCAGGCATCTCACCAGGTCCAGGCAACAC ACCT AC TC C T TGGTGGGACATGCACCCT 57 9 

GlyGlyAsnTrpLeuGlyGlyGlyGlnGlyGlnGlyAspGlyLeuGlnGlyThrLeuSer 180 

I ■ I I I I I I I I I I I I I 1 I I I I I I i I I t I I I I I MM lllllll MINIMI 

GGGGGCAACTGGCTAGGTGGTGGTCAGGGCCAGGGTGATGGGCTGCAAGGGACACTGTCC 639 



Qy 


1 


Db 


100 


Qy 


21 


Db 


160 


Qy 


41 


Db 


220 


Qy 


61 


Db 


280 


Qy 


81 


Db 


340 


Qy 


101 


Db 


400 


Qy 


121 


Db 


460 


Qy 


141 


Db 


520 


Qy 


161 


Db 


580 
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Qy 181 ThrGlyProAlaGlnProProLeuAsnProGlnLeuProThrTyrProSerAspPheAla 200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I | | | | | | | | 
Db 640 ACAGGCC C TGC C C AGC CTC C AC TGAAC CC CC AGC TGC C T AC TT AC CC ATCTGAC T T TGC T 699 

Qy 201 ProLeuAsnProAlaProTyrProAlaProHisLeuLeuGlnProGlyProGlnHisVal 220 

I t I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | | | | | | | | | | | | | | | | | 
Db 700 CCCCTTAACCCAGCTCCCTACCCAGCGCCCCACCTCTTGCAACCAGGGCCCCAGCATGTC 759 

Qy 221 LeuProGlnAspValTyrLysProLysAlaValGlyAsnSerGlyGlnLeuGluGlySer 240 

I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 760 C TACCCCAAGATGTC TATAAGCCCAAGGCGGT TGGCAATAGTGGGCAAC TGGAGGGGAGT 819 

Qy 241 GlyAlaAlaLysProProArgGlyAlaGlyThrGlyGlySerGlyGlyTyrAlaGlySer 260 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M | 
Db 820 GGTGCAGCCAAACCCCCTCGGGGTGCTGGCACAGGGGGCAGCGGTGGATATGCGGGCAGT 879 

Qy 261 GlyAlaGlyArgSerThrCysAspCysProAsnCysGlnGluLeuGluArgLeuGlyAla 280 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I 
Db 880 GGGGC AGGGCGT TCT AC C TGC GAC TGCC CCAAC TGTC AGGAGC T AGAGC GGC TC GGGGC A 939 

Qy 281 AlaAlaAlaGlyLeuArgLysLysProIleHisSerCysHisIleProGlyCysGlyLys 300 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 940 GCAGC GGC TGGGC TGAGGAAGAAGCCC AT TCACAGC TGCC AC AT CC CTGGGT GC GGCAAG 999 

Qy 301 ValTyrGlyLysAlaSerHisLeuLysAlaHisLeuArgTrpHisThrGlyGluArgPro 320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1000 GTGTACGGCAAGGCTTCGCATCTGAAAGCCCACTTGCGCTGGCACACTGGCGAGAGGCCT 1059 

Qy 321 PheValCysAsnTrpLeuPheCysGlyLysArgPheThrArgSerAspGluLeuGluArg 340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1060 T TC GTC T GCAAC TGGC T T T TC TGC GGCAAGAGGT TC AC TC GC T C TGAC GAGC TGGAGC GC 1119 

Qy 341 HisValArgThrHisThrArgGluLysLysPheThrCysLeuLeuCysSerLysArgPhe 360 

I I I M I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
Db 1120 CACGTGCGCACTCACACCCGGGAGAAGAAGTTCACTTGCCTGCTCTGTTCCAAGCGCTTT 117 9 

Qy 361 ThrArgSerAspHisLeuSerLysHisGlnArgThrHisGlyGluProGlyProGlyPro 380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1180 ACC AGAAGCGACC AC T TGAGC AAAC ATC AGC GC AC C C ACGGGGAGCC AGGCC CGGGACCG 1239 

Qy 381 ProProSerGlyProLysGluLeuGlyGluGlyArgSerValGlyGluGluGluAlaAsn 400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1240 C C C C C AAGTGGC C C TAAGGAGCTGGGGGAGGGTC GCAGC GTC GGGGAAGAAGAAGC CAAT 1299 

Qy 401 GlnProProArgSerSerThrSerProAlaProProGluLysAlaHisGlyGlySerPro 420 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1300 CAGCCGCCCCGATCTTCCACTTCGCCTGCACCCCCAGAAAAAGCCCACGGAGGCAGCCCA 1359 

Qy 421 GluGlnSerAsnLeuLeuGluIle 428 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1360 GAGCAGAGCAACCTGCTAGAGATC 138 3 

RESULT 2 
ADR08250 

ID ADR08250 standard; cDNA; 3153 BP. 
XX 

AC ADR08250; 
XX 

DT 04-NOV-2004 (first entry) 
XX 
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SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b 
2,p2n.rge. 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09- 
734-329b-2.p2n.rge. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: July 25, 2006, 09:19:19 ; Search time 10184 Seconds 

(without alignments) 
4031.249 Million cell updates/sec 

Title : US-09-734-329B-2 
Perfect score: 2384 

Sequence: 1 MASSLLEEEAHYGSS PLAML PAPPEKAHGGSPEQSNLLEI 428 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 12732272 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlp 

-Q=/abss/ABSSWEB_spool/US09734 329/runat_25072006_091902_3663/app_query. fasta_l 
-DB=GenEmbl -QE T MT=f astap -SUFFIX=p2n. rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 - TRANS =human40 . cdi -LIST-45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE = LOCAL 
-OUTFMT=pto -NORM=ext -HEAPS IZE=500 -MINLEN=0 -MAXLEN=2 00000 0000 -HOST=abss06p 
-USER=US09734329_(5CGN_l_l_8328_@runat_25072006_091902_3663 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 - THREADS =1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl:* 

1: gb_env:* 
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2 




y ky Cl L> • 


3 




gb ph ! * 


4 






5 






6 






7 




of c • * 

y yj o L. O • 


8 






9 






10 


y iw> vi . 


11 


gb ov: * 


12 


gb_htg : 


13 


gb_in : * 


14 


gb_om : * 


15 


gb_ba : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


2384 


100 


.0 


2284 


6 


BC113150 


BC113150 Mus muscu 




2 


2384 


100 


.0 


2960 


2 


AX555029 


AX555029 Sequence 




3 


2384 


100 


.0 


2960 


6 


AF184902 


AF184902 Mus muscu 


c 


4 


2360 


99 


.0 


138860 


12 


AC055703 


AC055703 Mus muscu 


c 


5 


2360 


99 


.0 


205599 


6 


AC137156 


AC137156 Mus muscu 


c 


6 


2360 


99 


.0 


208289 


6 


AC163291 


AC163291 Mus muscu 




7 


2356 


98 


.8 


1403 


6 


AY803733 


AY803733 Mus muscu 




8 


2338 


98 


.1 


1287 


6 


AY177399 


AY177399 Rattus no 


c 


9 


2333 


97 


.9 


254686 


12 


AC097309 


AC097309 Rattus no 




10 


2264 . 5 


95 


.0 


1357 


5 


BC101549 


BC10154 9 Homo sapi 




X -L 


OOC.A ^ 
Z Z Oft . D 


95 


.0 


£. O 4 O 


c 
O 


AI IjUO / 3 


AiiDUD / j xiomo sapi 




12 


2264.5 


95 


.0 


2971 


5 


AF466179 


AF4 6617 9 Homo sapi 




13 


2264.5 


95 


.0 


2995 


5 


AF477981 


AF4 77981 Homo sapi 




14 


2264.5 


95 


.0 


3153 


2 


CQ851287 


CQ851287 Sequence 




15 


2264.5 


95 


.0 


3153 


5 


AK128520 


AK128520 Homo sapi 




16 


2240.5 


94 


.0 


1575 


2 


CQ740107 


CQ740107 Sequence 


c 


17 


2239.5 


93 


.9 


106551 


5 


AC073611 


AC073611 Homo sapi 




18 


2239.5 


93 


.9 


166697 


12 


AC021103 


AC021103 Homo sapi 




19 


2236.5 


93 


.8 


1350 


5 


AY150674 


AY150674 Homo sapi 




20 


2212.5 


92 


.8 


173487 


12 


AC166700 


AC166700 Bos tauru 




21 


1301 


54 


.6 


1433 


11 


AY380818 


AY380818 Danio rer 




22 


866.5 


36 


.3 


2381 


11 


AY769697 


AY769697 Danio rer 




23 


857.5 


36 


.0 


1940 


11 


BC067673 


BC067673 Danio rer 




24 


857.5 


36 


.0 


1977 


11 


AY731230 


AY731230 Danio rer 




25 


854.5 


35 


.8 


2380 


6 


AY591908 


AY591908 Mus muscu 




26 


843 


35 


.4 


1765 


11 


AY769696 


AY769696 Danio rer 




27 


842.5 


35 


.3 


2459 


11 


AY591905 


AY591905 Danio rer 




28 


842.5 


35 


.3 


240532 


12 


AC130082 


AC130082 Rattus no 




29 


838.5 


35 


.2 


2382 


11 


BC067654 


BC067654 Danio rer 


c 


30 


836.5 


35 


.1 


3128 


2 


CQ851014 


CQ851014 Sequence 


c 


31 


836.5 


35 


.1 


3128 


5 


AK128215 


AK128215 Homo sapi 




32 


836.5 


35 


.1 


171734 


6 


AL954713 


AL954713 Mouse DNA 




33 


836.5 


35 


.1 


174437 


5 


AC018470 


AC018470 Homo sapi 




34 


836.5 


35 


.1 


211696 


12 


AC099580 


AC099580 Mus muscu 




35 


833 


34 


.9 


2053 


11 


AY591904 


AY591904 Danio rer 




36 


829 


34 


.8 


155001 


11 


BX005032 


BX005032 Zebrafish 




37 


828.5 


34 


.8 


1618 


5 


AY167047 


AY16704 7 Homo sapi 
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>«? 0 


823 


34 


. 5 


1747 


11 


O v»» vj «J «J .> / 


Rrfl6SS97 nan-i r» r^r 
/ Ualllu ICI 


<3 -7 


822 . 


5 




. 5 


7 91 9 


1 1 

J. X 


ZVY4 ^71 4 1 


HY4^"7141 nani r> r or 


40 


818. 


5 


34 


. 3 


3471 


2 


AX 714 48 6 




41 


818. 


5 


34 


.3 


3471 


5 


AK056857 


AK056857 Homo sapi 


42 


815. 


5 


34 


.2 


190372 


6 


AC131717 


AC131717 Mus muscu 


43 


815. 


5 


34 


.2 


191231 


6 


AC126277 


AC126277 Mus muscu 


44 


811. 


5 


34 


.0 


3581 


5 


BC038669 


BC038669 Homo sapi 


45 


810. 


5 


34 


.0 


1606 


5 


AY167048 


AY167048 Homo sapi 



ALIGNMENTS 



RESULT 1 
BC113150 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



CONSRTM 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
CONSRTM 
TITLE 
JOURNAL 

REMARK 
COMMENT 



BC113150 2284 bp mRNA linear ROD 06-FEB-2006 

Mus musculus trans-acting transcription factor 7, mRNA (cDNA clone 
MGC: 130562 IMAGE : 40054399 ) , complete cds . 
BC113150 

BC113150.1 GI: 86577831 
MGC. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 

1 (bases 1 to 2284) 

Strausberg, R. L. , Feingold, E.A. , Grouse, L.H. , Derge, J.G. , 
Klausner, R.D. , Collins, F. S . , Wagner, L., Shenmen, C .M. , Schuler, G.D. , 
Altschul, S. F. , Zeeberg,B., Buetow,K.H., Schaef er , C . F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh, F. , 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo, M. F . , Casavant, T . L. , 
Scheetz, T . E . , Brownstein, M. J. , Usdin, T .B. , Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano, N. A. , Peters, G. J., 
Abramson, R.D. , Mullahy, S . J . , Bosak,S.A., McEwan, P.J., 
McKernan, K. J. , Malek, J. A. , Gunaratne , P . H . , Richards, S., 
Worley,K.C, Hale,S., Garcia,A.M., Gay, L.J. , Hulyk,S.W., 
Villalon, D.K. , Muzny, D.M., Sodergren, E . J. , Lu, X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C, Shevchenko, Y . , 
Bouf f ard, G.G. , Blakesley, R .W. , Touchman, J. W. , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C . , Grimwood,J., Schmutz,J., Myers, R.M., 
Butter field, Y . S . , Krzywinski,M. I . , Skalska,U., Smailus, D . E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Mammalian Gene Collection Program Team 

Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
12477932 

2 (bases 1 to 2284) 

NIH MGC Project 
Direct Submission 

Submitted (05-FEB-2006) National Institutes of Health, Mammalian 
Gene Collection (MGC), Bethesda, MD 20892-2590, USA 
NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 
Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Baylor Human Genome Sequencing Center 

cDNA Library Preparation: Baylor Human Genome Sequencing Center 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
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FEATURES 

source 



gene 



CDS 



DNA Sequencing by: National Institutes of Health Intramural 

Sequencing Center (NISC), 

Gaithersburg, Maryland; 

Web site: http://www.nisc.nih.gov/ 

Contact : nisc_mgc@nhgri . nih . gov 

Akhter, N. , Ayele, K. , Beckstrom-Sternberg, S .M. , Benjamin, B. , 
Blakesley, R.W. , Bouf f ard, G .G. , Breen,K., Brinkley,C, Brooks, S., 
Dietrich, N.L. , Granite, S., Guan,X., Gupta, J., Haghighi,P., 
Hansen, N. , Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., 
Maduro, Q. L. , Masiello, C . , Maskeri, B. , Mastrian, S .D . ,McCloskey, J.C. , 
McDowell, J., Pearson, R., Stantripop, S . , Thomas, P.J. , Touchman, J.W. , 
Tsurgeon,C, Vogt,J.L., Walker, M. A., Wetherby, K.D. , Wiggins, L., 
Young, A., Zhang, L.-H. and Green, E.D. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAM Plate: 15 Row: f Column: 19. 

Location/Qualifiers 

1. .2284 

/organism="Mus musculus" 
/mol_type="mRNA" 
/db_xr e f = " taxon : 1 0 0 9 0 " 
/clone="MGC: 130562 IMAGE : 40054399 " 
/tissue_type="PCR rescued clones" 
/clone_lib=" NIH_MGC_2 85" 

/note="Vector : pCR-Blunt II-TOPO with reversed insert; 

Clone identification sequence tag: ATAGAGTG" 

1. .2284 

/gene="Sp7" 

/note=" synonym: C22" 

/db_xref ="GeneID : 170574 " 

/db_xref ="MGI : 2153568 " 

38. .1324 

/gene="Sp7" 

/ codon_start=l 

/product="Sp7 protein" 

/pr o t ein_id= " AAI 13151.1" 

/db_xref ="GI : 86577832 " 

/db_xref="GeneID: 170574" 

/db_xref ="MGI : 2153568 " 

/translation="MASSLLEEEAHYGSSPLAMLTAACSKFGGSSPLRDSTTLGKGGT 
KKPYADLS7VPKTMGDAYPAPFSSTNGLLSPAGSPPAPASGYANDYPPFPHSFPGPTGA 
QDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWYKAGIHAGISPGPGNTPTPWWDMHP 
GGNWLGGGQGQGDGLQGTLSTGPAQPPLNPQLPTYPSDFAPLNPAPYPAPHLLQPGPQ 
HVLPQDVYKPKAVGNSGQLEGSGAAKPPRGAGTGGSGGYAGSGAGRSTCDCPNCQELE 
RLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHTGERPFVCNWLFCGKRFTR 
SDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGEPGPGPPPSGPKELGEGR 
SVGEEEANQPPRSSTSPAPPEKAHGGSPEQSNLLEI" 



ORIGIN 



Alignment Scores: 

Pred. No. : 3.49e-131 

Score: 2384.00 

Percent Similarity: 100.0% 
Best Local Similarity: 100.0% 

Query Match: 100.0% 

DB: 6 



Length: 2284 

Matches: 428 

Conservative: 0 

Mismatches: 0 

Indels : 0 

Gaps : 0 



US-09-734-329B-2 (1-428) x BC113150 (1-2284) 

Qy 1 MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
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1 1 I I I 1 1 I I I I I I i I 1 1 I I 1 1 1 1 I I 1 1 I I 1 1 1 I I 1 1 I I I I I I 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 

Db 38 ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 97 

Qy 21 ThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThrThrLeuGly 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 98 ACTGCAGCCTGCAGCAAATTTGGCGGCTCTAGCCCTCTGCGGGACTCAACAACCCTGGGG 157 

Qy 41 LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 60 

I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 158 AAAGGAGGC AC AAAGAAGC CAT AC GC TGAC C T T TCAGCCC C CAAAACC ATGGGGGAC GCC 217 

Qy 61 TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 218 TACCCAGCTCCCTTCTCAAGCACCAATGGACTCCTCTCTCCTGCAGGCAGTCCTCCGGCC 277 

Qy 81 ProAlaSerGlyTyrAlaAsnAspTyrProProPheProHisSerPheProGlyProThr 100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 C C AGC C T C TGGC TAT GC AAATGAC T AC C C ACC C T TC CC T C AC TC AT T T C CTGGGC C C ACC 337 

Qy 101 GlyAlaGlnAspProGlyLeuLeuValProLysGlyHisSerSerSerAspCysLeuPro 120 

I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 338 GGT GC CC AAGAC C C T GGGC TCC TAGTGC C TAAGGGGCAC AGC TC GTC T GAC T GCC T GCC T 397 

Qy 121 SerValTyrThrSerLeuAspMetThrHisProTyrGlySerTrpTyrLysAlaGlylle 140 

I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 398 AGTGTCTACACTTCCCTGGATATGACTCATCCCTATGGCTCGTGGTACAAGGCAGGCATC 457 

Qy 141 HisAlaGlylleSerProGlyProGlyAsnThrProThrProTrpTrpAspMetHisPro 160 

I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 458 C AC GC AGGCAT C TC ACC AGGT C CAGGCAAC AC ACC T AC TCC T TGGTGGGAC AT GC AC C C T 517 

Qy 161 GlyGlyAsnTrpLeuGlyGlyGlyGlnGlyGlnGlyAspGlyLeuGlnGlyThrLeuSer 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 518 GGGGGCAACTGGCTAGGTGGTGGTCAGGGCCAGGGTGATGGGCTGCAAGGGACACTGTCC 577 

Qy 181 ThrGlyProAlaGlnProProLeuAsnProGlnLeuProThrTyrProSerAspPheAla 200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 578 ACAGGCCCTGCCCAGCCTCCACTGAACCCCCAGCTGCCTACTTACCCATCTGACTTTGCT 637 

Qy 201 ProLeuAsnProAlaProTyrProAlaProHisLeuLeuGlnProGlyProGlnHisVal 220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 638 CCCCTTAACCCAGCTCCCTACCCAGCGCCCCACCTCTTGCAACCAGGGCCCCAGCATGTC 697 

Qy 221 LeuProGlnAspValTyrLysProLysAlaValGlyAsnSerGlyGlnLeuGluGlySer 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 698 C TACCCC AAGATGTC TATAAGCCC AAGGCGGT TGGCAATAGTGGGCAACTGGAGGGGAGT 757 

Qy 241 GlyAlaAlaLysProProArgGlyAlaGlyThrGlyGlySerGlyGlyTyrAlaGlySer 260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 758 GGTGCAGCCAAACCCCCTCGGGGTGCTGGCACAGGGGGCAGCGGTGGATATGCGGGCAGT 817 

Qy 2 61 GlyAlaGlyArgSerThrCysAspCysProAsnCysGlnGluLeuGluArgLeuGlyAla 280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 818 GGGGCAGGGCGTTCTACCTGCGAC TGCCCCAACTGTCAGGAGCTAGAGCGGC TCGGGGCA 877 

Qy 281 AlaAlaAlaGlyLeuArgLysLysProIleHisSerCysHisIleProGlyCysGlyLys 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 878 GC AGC GGC TGGGC T GAGGAAGAAGCCC AT T CAC AGC T GC C AC AT CC C T GGGT GCGGCAAG 937 

Qy 301 ValTyrGlyLysAlaSerHisLeuLysAlaHisLeuArgTrpHisThrGlyGluArgPro 320 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 

Qy 

Db 



938 GTGTACGGCAAGGCTTCGCATCTGAAAGCCCACTTGCGCTGGCACACTGGCGAGAGGCCT 997 

321 PheValCysAsnTrpLeuPheCysGlyLysArgPheThrArgSerAspGluLeuGlu/lrg 340 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I 
998 TTCGTCTGCAACTGGCTTTTCTGCGGCAAGAGGTTCACTCGCTCTGACGAGCTGGAGCGC 1057 



Qy 341 HisValArgThrHisThrArgGluLysLysPheThrCysLeuLeuCysSerLysArgPhe 360 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 1 
Db 1058 C AC GTGC GC AC TC AC ACC C GGGAGAAGAAGT T C AC C TGC C TGC TC TGT TC CAAGCGC T T T 1117 



Qy 

Db 

Qy 

Db 



361 ThrArgSerAspHisLeuSerLysHisGlnArgThrHisGlyGluProGlyProGlyPro 380 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 1 I I I I I I I I I I I I 
1118 ACC AGAAGCGACCAC TTGAGCAAACATC AGCGCACC C ACGGGGAGCCAGGCC CGGGACCG 1177 



381 



400 



ProProSerGlyProLysGluLeuGlyGluGlyArgSerValGlyGluGluGluAlaAsn 
II I I I I I I I I I II I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
1178 CC C CCAAGTGGCC C T AAGGAGC TGGGGGAGGG TC GC AGC GTCGGGGAAGAAGAAGC CAAT 1237 



Qy 401 GlnProProArgSerSerThrSerProAlaProProGluLysAlaHisGlyGlySerPro 420 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 1238 CAGCCGCCCCGATCTTCCACTTCGCCTGCACCCCCAGAAAAAGCCCACGGAGGCAGCCCA 1297 

Qy 421 GluGlnSerAsnLeuLeuGluIle 428 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1298 GAGC AGAGCAACC TGC T AGAGATC 1321 



RESULT 2 
AX555029 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



AX555029 2960 bp DNA linear PAT 27-NOV-2002 

Sequence 1 from Patent WO0244380. 

AX555029 

AX555029.1 GI: 25898589 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

de Crombrugghe, B. , Nakashima, K. and Zhou,X. 

Master bone formation transcription factor: compositions and 
methods of use 

Patent: WO 0244380-A 1 06-JUN-2002; 

BOARD OF REGENTS THE UNIVERSITY OF TEXAS SYSTEM (US) 
Location/Qualifiers 
1. .2960 

/organism="Mus musculus" 
/mol_type="unassigned DNA" 
/db_xref="taxon: 10090" 
100. .1386 

/note="unnamed protein product" 
/codon_start=l 
/protein_id="CAD58224 . 1" 
/db_xref="GI:25898590" 

/trans la tion= "MAS SLLEEEAHYGSSPLAMLTAACSKFGGSSPLRDSTTLGKGGT 
KKP Y ADLS APKTMGDAY PAP FS S TNGLLS PAGS P PAPASG YAND Y PPFPHS FPGPTGA 
QDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWYKAGIHAGISPGPGNTPTPWWDMHP 
GGNWLGGGQGQGDGLQGTLSTGPAQPPLNPQLPTYPSDFAPLNPAPYPAPHLLQPGPQ 
HVLPQDVYKPKAVGNSGQLEGSGAAKPPRGAGTGGSGGYAGSGAGRSTCDCPNCQELE 
RLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHTGERPFVCNWLFCGKRFTR 
SDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGEPGPGPPPSGPKELGEGR 
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SVGEEEANQPPRSSTSPAPPEKAHGGSPEQSNLLEI 



Alignment Scores: 

Pred. No. : 4 .78e-131 



Length: 
Matches : 



2960 
428 



Score: 2384.00 

Percent Similarity: 100.0% 

Best Local Similarity: 100.0% 

Query Match: 100.0% 

DB: 2 



Conservative : 
Mismatches : 



Indels : 
Gaps : 



0 
0 
0 
0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



734-329B-2 (1-428) x AX555029 (1-2960) 

1 MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

100 ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 159 

21 ThrAlaAlaCysSerLysPheGlyGlySerSerProLeiiArgAspSerThrThrLeuGly 40 
I I I II I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

160 AC T GC AGCC T GC AGC AAAT T TGGC GGC T C TAGC CC T C TGC GGGAC TC AACAAC C C TGGGG 219 

41 LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 60 
f I I I 1 I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I 1 I I II I I I I I I I 

220 AAAGGAGGC AC AAAGAAGC CAT AC GC T GAC C T T TC AGCC C CCAAAACC ATGGGGGAC GC C 279 

61 TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 80 
I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I M I M I I 
280 TACCCAGCTCCCTTCTCAAGCACCAATGGACTCCTCTCTCCTGCAGGCAGTCCTCCGGCC 339 

81 ProAlaSerGlyTyrAlaAsnAspTyrProProPheProHisSerPheProGlyProThr 100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
340 CCAGCCTCTGGCTATGCAAATGACTACCCACCCTTCCCTCACTCATTTCCTGGGCCCACC 399 

101 GlyAlaGlnAspProGlyLeuLeuValProLysGlyHisSerSerSerAspCysLeuPro 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
400 GGT GC CC AAGAC CC T GGGC TC C TAGTGC C TAAGGGGC AC AGC TCGTC T GAC T GC C TGCC T 459 

121 SerValTyrThrSerLeuAspMetThrHisProTyrGlySerTrpTyrLysAlaGlylle 140 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
4 60 AGT GTCT AC AC T TCC C TGGAT AT GAC TC ATC C C TAT GGC T C GTGGT AC AAGGCAGGCAT C 519 

141 HisAlaGlylleSerProGlyProGlyAsnThrProThrProTrpTrpAspMetHisPro 160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
520 CAC GCAGGCATCTCACCAGGTCCAGGCAACACACC T AC TC CT TGGTGGGACATGCACCC T 579 

161 GlyGlyAsnTrpLeuGlyGlyGlyGlnGlyGlnGlyAspGlyLeuGlnGlyThrLeuSer 180 
I I I I I I I I II I I I I I I I I I I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

580 GGGGGCAACTGGCTAGGTGGTGGTCAGGGCCAGGGTGATGGGCTGCAAGGGACACTGTCC 639 

181 ThrGlyProAlaGlnProProLeuAsnProGlnLeuProThrTyrProSerAspPheAla 200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
640 AC AGGCC C TGC C C AGCC T C CAC T GAAC C C C C AGC TGCC TAC T TAC C C ATC TGAC T T TGC T 699 

201 ProLeuAsnProAlaProTyrProAlaProHisLeuLeuGlnProGlyProGlnHisVal 220 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
700 CCCCTTAACCCAGCTCCCTACCCAGCGCCCCACCTCTTGCAACCAGGGCCCCAGCATGTC 759 

221 LeuProGlnAspValTyrLysProLysAlaValGlyAsnSerGlyGlnLeuGluGlySer 240 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

760 C TACC CC AAGATGTC TATAAGC C CAAGGC GGT TGGC AATAGTGGGCAAC TGGAGGGGAGT 819 
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Qy 



Db 



241 GlyAlaAlaLysProProArgGlyAlaGlyThrGlyGlySerGlyGlyTyrAlaGlySer 260 

I I I I I I I I I I I I I I I I I I I 11 I I I I I I I II I I I I I I I I I II I ! I I I I I I I I I 1 I I I I I I I 
820 GGTGCAGCCAAACCCCCTCGGGGTGCTGGCACAGGGGGCAGCGGTGGATATGCGGGCAGT 879 



Qy 



Db 



261 GlyAlaGlyArgSerThrCysAspCysProAsnCysGlnGluLeuGluArgLeuGlyAla 280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
880 GGGGC AGGGC GT TCT AC C T GC GAC TGC C C CAAC TGT C AGG AGC T AGAGCGGC TC GGGGC A 939 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



281 



940 



301 



AlaAlaAlaGlyLeuArgLysLysProIleHisSerCysHisIleProGlyCysGlyLys 
I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I 
GCAGCGGCTGGGCTGAGGAAGAAGCCCATTCACAGCTGCCACATCCCTGGGTGCGGCAAG 



300 



999 



320 



ValTyrGlyLysAlaSerHisLeuLysAlaHisLeuArgTrpHisThrGlyGluArgPro 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1000 GTGTACGGCAAGGCTTCGCATCTGAAAGCCCACTTGCGCTGGCACACTGGCGAGAGGCCT 1059 



321 



340 



PheValCysAsnTrpLeuPheCysGlyLysArgPheThrArgSerAspGluLeuGluArg 
I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I 
1060 TTCGTCTGCAACTGGCTTTTCTGCGGCAAGAGGTTCACTCGCTCTGACGAGCTGGAGCGC 1119 

341 HisValArgThrHisThrArgGluLysLysPheThrCysLeuLeuCysSerLysArgPhe 360 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
1120 C AC GT GC GC AC TC AC AC C C GGGAGAAGAAGT T C AC T T GCC TGC TC TGT TC C AAGCGC T T T 117 9 



361 



380 



ThrArgSerAspHisLeuSerLysHisGlnArgThrHisGlyGluProGlyProGlyPro 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 
1180 ACCAGAAGCGACCACTTGAGCAAACATCAGCGCACCCACGGGGAGCCAGGCCCGGGACCG 1239 



Qy 381 ProProSerGlyProLysGluLeuGlyGluGlyArgSerValGlyGluGluGluAlaAsn 400 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I 
Db 1240 CC C C CAAGTGGCC CT AAGGAGC TGGGGGAGGGTC GC AGCG TC GGGGAAGAAGAAGC CAAT 1299 

Qy 401 GlnProProArgSerSerThrSerProAlaProProGluLysAlaHisGlyGlySerPro 420 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1300 C AGCC GC C CC GAT CT T C CAC T TCGCC TGC AC C CC C AGAAAAAGC C C AC GGAGGC AGC CC A 1359 



Qy 421 GluGlnSerAsnLeuLeuGluIle 428 

I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1360 GAGC AGAGCAAC C TGC T AGAGATC 138 3 



RESULT 3 
AF184 902 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



AF184902 2960 bp mRNA linear ROD 12-JAN-2002 

Mus musculus Osterix (C22) mRNA, complete cds . 

AF184902 

AF184902.1 GI: 18138080 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 

1 (bases 1 to 2960) 

Nakashima, K. , Zhou,X., Kunkel,G., Zhang, Z., Deng, J. M. , 
Behringer , R. R. and de Crombrugghe, B. 

The novel zinc finger-containing transcription factor osterix is 
required for osteoblast differentiation and bone formation 
Cell 108 (1) , 17-29 (2002) 
11792318 

2 (bases 1 to 2960) 

Nakashima, K. , Zhou,X. and de Crombrugghe, B . 
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TITLE 
JOURNAL 

FEATURES 

source 



gene 



CDS 



Direct Submission 

Submitted ( 10-SEP-1999) Molecular Genetics, M.D. Anderson Cancer 
Center, 1515 Holcombe, Houston, TX 77030, USA 

Location/Qualifiers 

1. .2960 

/organism="Mus musculus" 

/mol_type="mRNA" 

/db_xref="taxon: 10090" 

/cell_line="C2C12" 

/cell_type="myoblas ts " 

1. .2960 

/gene="C22" 

100. .1386 

/gene="C22" 

/note="zinc finger protein; osteoblast/chondrocyte 

specific transcription factor" 

/codon_start=l 

/product="Osterix" 

/protein_id="AAL60067 .1" 

/db_xref ="GI : 18138081 " 

/trans la tion= "MAS SLLEEEAHYGSSPLAMLTAACSKFGGSSPLRDSTTLGKGGT 
KKPYADLSAPKTMGDAYPAPFSSTNGLLSPAGSPPAPASGYANDYPPFPHSFPGPTGA 
QDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWYKAGIHAGISPGPGNTPTPWWDMHP 
GGNWLGGGQGQGDGLQGTLSTGPAQPPLNPQLPTYPSDFAPLNPAPYPAPHLLQPGPQ 
HVLPQDVYKPKAVGNSGQLEGSGAAKPPRGAGTGGSGGYAGSGAGRSTCDCPNCQELE 
RLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHTGERPFVCNWLFCGKRFTR 
SDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGEPGPGPPPSGPKELGEGR 
SVGEEEANQPPRSSTS PAPPEKAHGGS PEQSNLLEI " 



ORIGIN 



Alignment Scores: 

Pred. No. : 4.78e-131 

Score: 2384.00 

Percent Similarity: 100.0% 
Best Local Similarity: 100.0% 

Query Match: 100.0% 

DB: 6 



Length: 2960 

Matches: 428 

Conservative: 0 

Mismatches : 0 

Indels : 0 

Gaps: 0 



US-09-734-329B-2 (1-428) x AF184902 (1-2960) 



Qy 



Db 



100 



MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 159 



Qy 



Db 



21 ThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThrThrLeuGly 40 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
160 ACTGCAGCCTGCAGCAAATTTGGCGGCTCTAGCCCTCTGCGGGACTCAACAACCCTGGGG 219 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



41 LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 60 
I I I 11 I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
220 AAAGGAGGCAC AAAGAAGCCATAC GC TGACCT TTCAGCCC CCAAAACCATGGGGGACGCC 279 

61 TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I I 
280 TACCCAGCTCCCTTCTCAAGCACCAATGGACTCCTCTCTCCTGCAGGCAGTCCTCCGGCC 339 

81 ProAlaSerGlyTyrAlaAsnAspTyrProProPheProHisSerPheProGlyProThr 100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I 
340 CCAGCCTCTGGCTATGCAAATGACTACCCACCCTTCCCTCACTCATTTCCTGGGCCCACC 399 

101 GlyAlaGlnAspProGlyLeuLeuValProLysGlyHisSerSerSerAspCysLeuPro 120 
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1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I ! I i i i i i i i i ii i i i i i i i i i i i i i i i i i i i t i 




Db 


400 


1 yjrK ^ ^ ^rtrtvj-HA^ ^ i vjVtVjo ilci >vj i CtC C I AAGGGGC AC AGC TC GTC TGAC T GC C T GC C T 


459 


Qy 


121 


SerValTyrThrSerLeuAspMetThrHisProTyrGlySerTrpTyrLysAlaGlylle 


140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 ! f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I r i 

1 1 1 1 ' ' 1 M 1 1 • 1 1 '» > 1 1 M 1 II 1 1 1 11 I I 1 I 1 || | 1 || (I 1 | | || || || II II | | | | | 




Db 


460 


± \j ±\s x tv^sv^, ii^^-^i j\i LtAU 1 OA I C C C TAT GGC T C GTGGT AC AAGGC AGGC ATC 


519 


Qy 


141 


HisAlaGlylleSerProGlyProGlyAsnThrProThrProTrpTrpAspMetHisPro 


160 


Db 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 M 1 1 II 1 1 1 t 1 I I t 1 I I I I I I I I I I i i t i i i i 
■ ■ 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I M 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | | | | 1 | | | f | | | | | | | 




520 


w^vjw\u^jw\± \* i u/v^/\LtL7 i u UmjLrCAAO AC AC CT AC TC C T TGGTGGGAC ATGC ACCC T 


579 


Qy 


161 


GlyGlyAsnTrpLeuGlyGlyGlyGlnGlyGlnGlyAspGlyLeuGlnGlyThrLeuSer 


180 


Db 




1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 11 l ll if M i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i 
1 1 ' 1 1 ' 1 1 1 1 1 1 I 1 1 1 1 1 1 1 [ 1 | t 1 M 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 I I M I | M M | | | 




S80 


\jte\j*jty^j\iv>* ± IjLtL, I AWjl bG I GGTCAGGGCCAGGGTGATGGGCTGCAAGGGACACTGTCC 


639 


Qy 


181 


ThrGlyProAlaGlnProProLeuAsnProGlnLeuProThrTyrProSerAspPheAla 


200 






1 1 1 1 i 1 1 1 1 1 1 1 l l l l l I I t I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 M 1 1 1 1 ' l l l l l l l 1 1 1 | | | | | | | | || 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I I | | 




Db 


OIL* 


uli LtU u l AGU C 1 C CAC TGAAC C CCC AGC T GCC TAC T TAC CC AT C TGAC T T TGC T 


699 


Qy 


201 


ProLeuAsnProAlaProTyrProAlaProHisLeuLeuGlnProGlyProGlnHisVal 


220 


Db 




1 1 1 H 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I } I I I I t f i i i i i i i i i i i i i i i i i i i 

1 1 1 ' 1 1 ' 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 | M | M | | | M | | | | | | | | | | | | | | | J | | | ( | | | | | | 




700 

/WW 


i~<~ol 1 1 AAULLAGC i CCC TACCCAGCGCCCCACCTCTTGCAACCAGGGCCCCAGCATGTC 


759 


Qy 


221 


LeuProGlnAspValTyrLysProLysAlaValGlyAsnSerGlyGlnLeuGluGlySer 


240 


Db 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l I I I I I I i i i i i i i i i i i i i i i i i i t i i i i i t i i i t i i i i i 
1 1 1 1 ' ' 1 1 1 i i ii I I I I I I I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I I I I | | | | | | | | J | | | | | | | | | 




760 


o l AUUCCAAGA 1 GT C TAT AAGC CC AAGGC GGT TGGCAAT AGTGGGCAAC T GGAGGGGAGT 


819 


Qy 


241 


GlyAlaAlaLysProProArgGlyAlaGlyThrGlyGlySerGlyGlyTyrAlaGlySer 


260 


Db 




l i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i • i i ,i 

1 1 1 1 1 1 1 1 • 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j 




R?0 


Vj<j i w-aijOUAAACCCCC 1 C GGGGT GC TGGC AC AGGGGGC AGCGGTGGAT ATGCGGGC AGT 


879 


Qy 


261 


GlyAlaGlyArgSerThrCysAspCysProAsnCysGlnGluLeuGluArgLeuGlyAla 


280 


Db 




t l I I I I I I I I I i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i 

' i i i i i i i i i i i i i i i | | | | | | | | f | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 




880 


LtltVjltL/«jLt(jCL7 1 1 c 1 ACC I GCGACTGCCCCAACTGTCAGGAGCTAGAGCGGCTCGGGGCA 


939 


Qy 


281 


AlaAlaAlaGlyLeuArgLysLysProIleHisSerCysHisIleProGlyCysGlyLys 


300 


Db 




1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l i i i i i i i i i i i i i t i i tii 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 | | | | | | M j | | 




940 


ltl/\ltC WjL i LtVjvjO i GAGGAAGAAGC C CAT TC AC AGC TGC CAC AT C C C T GGGT GC GGCAAG 


999 


Qy 


301 


ValTyrGlyLysAlaSerHisLeuLysAlaHisLenArgTrpHisThrGlyGluArgPro 


320 


Db 




1 1 II I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I I II I it I i i t t i t i i i i i i t i i i i i i i t i i i i i i i i i i 
1 1 1 • 1 1 1 1 1 'I' I' 1 1 ' 1 M II 1 1 1 1 M If 1 1 1 If II 1 1 1 1 1 1 1 | | | | | | M | | | | | | | | 




1 000 


i Lr i LtCtCAACtGC T TC GC ATC TGAAAGCCC AC T TGCGC TGGCACAC T GGC GAGAGGCC T 


1059 


Qy 


321 


PheValCysAsnTrpLeuPheCysGlyLysArgPheThrArgSerTVspGluLeuGluArg 


340 


Db 




1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 II 1 l l l l l i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i 
i i i i i i i i i i if i i i i i i i l l l l l l 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | 




1 060 


u ^ bILX (jUAAL 1 trvrL. i 1 1 1 C TGC GGCAAGAGGTTC AC TC GC TC TGAC GAGC TGGAGCGC 


1119 


Qy 


341 


HisValArgThrHisThrArgGluLysLysPheThrCysLeuLeuCysSerLysArgPhe 


360 


Db 




1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 t 1 l l l l i i i i i i i i i i t t i i i i t i i i i i t i i i i i i i i i > i i i i i 

1 1 1 M 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | f ( | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 




1120 


^iv^ vj i vjl «vhu i LHLKLtt irirtxAtjAAfcxAAGT T CAC T TGCC TGC TC TGT TC CAAGC GC TTT 


1179 


Qy 


361 


ThrArgSerAspHisLeuSerLysHisGlnArgThrHisGlyGluProGlyProGlyPro 


380 


Db 




1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 l l l l I I I I i i i i i i i i i i i i t i i i i i i i i i ■ 

i i i i i i i i f i i i i i i i l l I l 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | 




1180 


rt^LrttrMH^^HLLAL i loALr^AAACAlCAGCGCACCCACGGGGAGCCAGGCCCGGGACCG 


1239 


Qy 


381 


ProProSerGlyProLysGluLeuGlyGluGlyArgSerValGlyGluGluGluAlaAsn 


400 


Db 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l I i I I i I i i i i i i i i i i i i i i i I i i i i i i ii 

> > i i i i i i i i i i i i i i i i i i i i i I I I 1 I 1 M 1 1 1 1 1 1 1 I I I M 1 1 1 1 1 1 1 1 1 I I | | | | | | 




1240 


CCCCCAAGTGGCCCTAAGGAGCTGGGGGAGGGTrr;rnarcTrr^r^a^ 


1299 


Qy 


401 


GlnProProArgSerSerThrSerProAlaProProGluLysAlaHisGlyGlySerPro 


420 






lllllll 1 MINIMUM lllllill MMIII 1 MM MINIM 
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Db 

Qy 

Db 



1300 CAGCCGCCCCGATCTTCCACTTCGCCTGCACCCCCAGAAAAAGCCCACGGAGGCAGCCCA 1359 



421 



428 



GluGlnSerAsnLeuLeuGluIle 
I I I I I I I I I I I I I I I I I I I I I I I I 
1360 GAGCAGAGCAACCTGCTAGAGATC 1383 



RESULT 4 

AC055703/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



AC055703 138860 bp DNA linear HTG 15-MAY-2002 

Mus musculus strain C57BL6/J clone RP23-399N14, WORKING DRAFT 
SEQUENCE, 37 unordered pieces. 
AC055703 

AC055703. 9 GI : 20279384 
HTG; HTGS_PHASE1; HTGS_DRAFT. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 

1 (bases 1 to 138860) 

Grills, G., Han, J., Montgomery, K. T . , Lee,E., Long, J., Pomerantz, R. , 
Ioshikhes, I. P., Shim, C, Decker, J., Thomas, E., Perera,A., 
Gordon,M., Goltz,J.S. and Kucherlapati, R. 
High Throughput Mouse Sequencing 
Unpublished 

2 (bases 1 to 138860) 

Grills, G., Han, J., Montgomery, K.T. 
Ioshikhes, I. P., Shim,C., Decker, J. 



Lee, E 
Thomas, E 



Long, J. , Pomerantz, R. 
Perera, A. , 



Gordon, M. , Goltz,J.S. and Kucherlapati, R. 
Direct Submission 

Submitted (18-APR-2000) Department of Molecular Genetics, Albert 
Einstein College of Medicine Genome Center, 1300 Morris Park Ave., 
Bronx, NY 104 61, USA 
3 (bases 1 to 138860) 

Grills, G., Han, J., Montgomery, K. T . , Lee,E., Long, J., Pomerantz, R. , 
Ioshikhes, I. P. , Shim,C, Decker, J., Thomas, E., Perera, A., 
Gordon, M. , Goltz,J.S. and Kucherlapati, R. 
Direct Submission 

Submitted (24-APR-2002 ) Harvard Partners Center for Genetics and 
Genomics, Harvard Medical School, 65 Landsdowne St, Cambridge, MA 
02139, USA 

On Apr 24, 2002 this sequence version replaced gi: 18151001. 

Genome Center 

Center: Harvard Partners Genome Center 
Center Code: HPGC 

Web site: http://www.hpcgg.org/Sequence/mouse.html 
Contact : hpgc@mendel . mgh . harvard . edu 

Summary Statistics 

Center project name: AAS 
Sequencing vector: pUC18; L08752 
Chemistry: Dye-terminator Big Dye; 100% 
*Consensus quality: 126666 at least Q20 
*Consensus quality: 122952 at least Q30 
Consensus quality: 116499 at least Q40 
Estimated insert size: agarose-FP - N/A 
**Estimated insert size: 138140 - sum-of-contigs 
Quality coverage: agarose-FP - N/A 

Quality coverage: 9.1 x in Q20 bases; sum-of-contigs estimation 

* NOTE: This is a 'working draft 1 sequence. It currently 

* consists of 37 contigs. The true order of the pieces 
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SCORE Search Results Details for Application 
09734329 and Search Result us-09-734-329b- 
2,p2n.rst, 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page Ljst Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 09734329 and Search Result us-09- 

734-329b-2.p2n.rst. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - nucleic search, using frame_plus_p2n model 

Run on: July 25, 2006, 09:19:18 ; Search time 6389 Seconds 

(without alignments) 
5619.066 Million cell updates/sec 

Title : US-09-734-329B-2 
Perfect score: 2384 

Sequence: 1 MASSLLEEEAHYGSSPLAML PAPPEKAHGGSPEQSNLLEI 428 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7 . 0 

Delop 6.0 , Delext 7.0 

Searched: 48236798 seqs, 27959665780 residues 

Total number of hits satisfying chosen parameters: 96473596 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US09734329/runat_25072006_091905_3703/app_query. fasta_l 
-DB=EST -QFMT=fastap -SUFFIX=p2n. rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE = LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -HOST=abss08 
-USER=US09734329_@CGN_l_l_6323_@runat_25072006_091905_3703 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAP0P=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : EST:* 

1: gb_estl:* 
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2 




gb_est3 : * 


3 




gb_est4 : * 


4 




gb_est5 : * 


5 




gb_est6: * 


6 




gb_htc : * 


7 




gb_est2:* 


8 




gb_est7:* 


9 




gb_est8:* 


10: 


gb_est9: * 


11: 


gb_gssl : * 


12: 


gb_gss2 : * 


13: 


gb_gss3 : * 


14: 


gb_gss4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length DB 


ID 


Description 




1 


2384 


1UU 


. u 


z yu / 


b 


AK032521 


AK032521 Mus muscu 




2 


2350 


QQ 


. o 


£. O / H 


D 


AK077375 


AK077375 Mus muscu 




3 


2326 


J / 


c 


z y± / 


b 


AK076229 


AK07 622 9 Mus muscu 




4 


1182.5 


A Q 

h y 






A 

ft 


BX369746 


BX369746 BX369746 




5 


1119.5 


A 1 


. u 




A 

H 


BX394280 


BX394280 BX394280 




6 


1115 


A & 

H. D 


Q 

. o 




c 
D 


CF905412 


CF905412 A0434B03- 




7 


1115 


4fi 


. o 


DjO 


e 
O 


CF905401 


CF905401 A0434A03- 




8 


1100 


4fi 


i 

• -L 


DUO 


c 
O 


CF906401 


CF906401 A0446H03- 




9 


1037.5 


4^ 


• j 


74 


Q 
O 


CN287220 


CN287220 170005325 




10 


1000 


41 

•a J. 


Q 
• -7 


DIJ 


7 


BB622399 


BB622399 BB622399 




11 


808.5 


33 


Q 




D 


AK030745 


AK030745 Mus muscu 




12 


807.5 


33 


.9 


3659 


6 


AK053093 


AK053093 Mus muscu 




13 


799 


33 


.5 


2590 


6 


AK135123 


AK135123 Mus muscu 


c 


14 


754.5 


31 


.6 


1798 


6 


BC023224 


BC023224 Mus muscu 




15 


747.5 


31 


.4 


643 


7 


BB622895 


BB622895 BB622895 




16 


704 


29 


.5 


670 


7 


BB618195 


BB618195 BB618195 




17 


690 


28 


.9 


1141 


10 


DW031048 


DW031048 CFW209-B0 


c 


18 


672.5 


28 


.2 


5272 


6 


AK029830 


AK029830 Mus muscu 




19 


667.5 


28 


.0 


1131 


14 


DQ047750 


DQ047750 Homo sapi 




20 


662.5 


27 


.8 


1131 


14 


DQ047751 


DQ047751 Pan trogl 




21 


662 


27 


.8 


3599 


6 


AK137686 


AK13768 6 Mus muscu 




22 


660 


27 


.7 


583 


8 


C0778267 


C0778267 BL003B_F0 




23 


658.5 


27 


.6 


763 


3 


BP679532 


BP679532 BP679532 




24 


649 


27 


.2 


871 


8 


CX311164 


CX311164 JGI_XZT11 




25 


612.5 


25 


.7 


819 


5 


CK143124 


CK143124 AGENCOURT 


c 


26 


608.5 


25 


.5 


1088 


10 


DW665598 


DW665598 CNB332-C0 




27 


606.5 


25 


.4 


989 


3 


BU528562 


BU528562 AGENCOURT 




28 


588.5 


24 


.7 


561 


3 


BQ419249 


BQ419249 faa36e03. 


c 


29 


587 


24 


.6 


638 


1 


AL662349 


AL662349 AL662349 




30 


586 


24 


.6 


801 


9 


CX836896 


CX836896 JGI_CAAK7 




31 


576 


24 


.2 


2078 


3 


BP525450 


BP525450 BP525450 


c 


32 


572 


24 


.0 


641 


10 


DT247674 


DT247 674 JGI_CAAU2 




33 


564.5 


23 


.7 


778 


12 


CC862573 


CC862573 NDL.22K1. 




34 


562 


23 


.6 


753 


10 


DW556033 


DW556033 EST_ssal_ 




35 


557.5 


23 


.4 


443 


1 


AL960681 


AL960681 AL960681 




36 


555.5 


23 


.3 


474 


2 


BI979366 


BI979366 ft86e04.y 




37 


553.5 


23 


.2 


2097 


6 


AK036234 


AK036234 Mus muscu 




38 


552 


23 


.2 


471 


12 


CE281682 


CE281682 tigr-gss- 
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c 39 


517 


21. 


7 


686 


10 


DW253992 


DW253992 UI-S-GBl- 


40 


510 


21. 


4 


664 


2 


BJ626154 


BJ626154 BJ626154 


41 


509 


21. 


4 


1084 


10 


DW665599 


DW665599 CNB332-C0 


42 


506 


21. 


2 


899 


5 


CD757648 


CD757648 AGENCOURT 


c 43 


504 


21. 


1 


694 


10 


DW271862 


DW271862 UI-S-GSl- 


44 


504 


21. 


1 


866 


5 


CD755971 


CD7 55971 AGENCOURT 


45 


503 


21. 


1 


836 


5 


CD756314 


CD756314 AGENCOURT 
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TITLE 
JOURNAL 
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JOURNAL 
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REFERENCE 
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TITLE 

JOURNAL 
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JOURNAL 
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JOURNAL 
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AK032521 2907 bp mRNA linear HTC 02-SEP-2005 

Mus musculus adult male olfactory brain cDNA, RIKEN full-length 
enriched library, clone : 6430578P22 product : osterix, full insert 
sequence. 
AK032521 

AK032521.1 GI: 26328334 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
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Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
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Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium, the RIKEN Genome Exploration Research Group 
Phase I and II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 
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AUTHORS RIKEN Genome Exploration Research Group, Genome Science Group 

(Genome Network Core Team) and the FANTOM Consortium. 
TITLE Antisense ^Transcription in the Mammalian Transcriptome 
JOURNAL Science 309, 1564-1566 (2005) 
REFERENCE 7 

AUTHORS The FANTOM Consortium, Riken Genome Exploration Research Group and 
Genome Science Group (Genome Network Project Core Group) . 

TITLE The Transcriptional Landscape of the Mammalian Genome 

JOURNAL Science 309, 1559-1563 (2005) 
REFERENCE 8 (bases 1 to 2907) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida,K., Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T. , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y . 

TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa, 230-0045, Japan (E-mail: genome-res@gsc.riken.jp, 
URL :http: //genome. gsc .riken. jp/, Tel: 81-45-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL :http: //genome . gsc . riken. jp/ 
URL : http : // f antom . gsc . riken . j p/ . 
FEATURES Location/Qualifiers 
source 1. .2907 

/organism="Mus musculus" 

/mol_t ype= "mRNA" 

/strain="C57BL/6J" 

/db_xr e f = " FANTOM_DB : 6 4 30 5 7 8 P2 2 " 

/db_xref="taxon: 10090" 

/clone="6430578P22" 

/sex="male" 

/tissue_type="olfactory brain" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_s tage= " adult " 
CDS 93. .1379 

/note= n unnamed protein product; 
GB|NM_130458, evidence: BLASTN, 
putative" 
/codon_start=l 
/protein_id="BAC27908 .1" 
/db_xref="GI:26328335" 

/trans la tion= "MAS SLLEEEAHYGSSPLAMLTAACSKFGGSSPLRDSTTLGKGGT 
KKPYADLSAPKTMGDAYPAPFSSTNGLLSPAGSPPAPASGYANDYPPFPHSFPGPTGA 
QDPGLLVPKGHSSSDCLPSVYTSLDMTHPYGSWYKAGIHAGISPGPGNTPTPWWDMHP 
GGNWLGGGQGQGDGLQGTLSTGPAQPPLNPQLPTYPSDFAPLNPAPYPAPHLLQPGPQ 



osterix (LocusLink | 170574 
99%, match=1759) 
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polyA_signal 
polyA_site 
ORIGIN 



HVLPQDVYKPKAVGNSGQLEGSGAAKPPRGAGTGGSGGYAGSGAGRSTCDCPNCQELE 

RLGAAAAGLRKKPIHSCHIPGCGKVYGKASHLKAHLRWHTGERPFVCNWLFCGKRFTR 

SDELERHVRTHTREKKFTCLLCSKRFTRSDHLSKHQRTHGE PGPGPPPSGPKELGEGR 

SVGEEEANQPPRSSTSPAPPEKAHGGSPEQSNLLEI" 

2891. .2896 

/note="putative" 

2907 

/note="putative " 



Alignment Scores : 

Pred. No. : 3e-143 

Score: 2384.00 

Percent Similarity: 100.0% 
Best Local Similarity: 100.0% 

Query Match: 100.0% 

DB: 6 



Length: 2907 

Matches: 428 

Conservative : 0 

Mismatches: 0 

Indels: 0 

Gaps: 0 



US-09-734-329B-2 (1-428) x AK032521 (1-2907) 



Qy 


i 


Db 


93 


Qy 


21 


Db 


153 


Qy 


41 


Db 


213 


Qy 


61 


Db 


273 


Qy 


81 


Db 


333 


Qy 


101 


Db 


393 


Qy 


121 


Db 


453 


Qy 


141 


Db 


513 


Qy 


161 


Db 


573 


Qy 


181 


Db 


633 


Qy 


201 



MetAlaSerSerLeuLeuGluGluGluAlaHisTyrGlySerSerProLeuAlaMetLeu 20 
I I I I I I I I I I I I I I I I I t I I I I I I ! I II I I I I I I I I I I I I ! I 1 I I I I I I I I I I I I I I I I I 
ATGGCGTCCTCTCTGCTTGAGGAAGAAGCTCACTATGGCTCCAGTCCCCTGGCCATGCTG 152 

ThrAlaAlaCysSerLysPheGlyGlySerSerProLeuArgAspSerThrThrLeuGly 40 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AC T GCAGC CTGC AGC AAAT T TGGC GGC T C TAGCC C T C TGC GGGAC TC AAC AAC C C TGGGG 



LysGlyGlyThrLysLysProTyrAlaAspLeuSerAlaProLysThrMetGlyAspAla 
I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAAGGAGGCAC AAAGAAGC CAT AC GC TGACC T T TC AGC C C CCAAAACCATGG GGGAC GC C 

TyrProAlaProPheSerSerThrAsnGlyLeuLeuSerProAlaGlySerProProAla 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TACCCAGCTCCCTTCTCAAGCACCAATGGACTCCTCTCTCCTGCAGGCAGTCCTCCGGCC 

ProAlaSerGlyTyrAlaAsnAspTyrProProPheProHisSerPheProGlyProThr 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I 
CCAGCCTCTGGCTATGCAAATGACTACCCACCCTTCCCTCACTCATTTCCTGGGCCCACC 

GlyAlaGlnAspProGlyLeuLeuValProLysGlyHisSerSerSerAspCysLeuPro 
I I I I I I I I I M I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGTGCCCAAGACCCTGGGCTCCTAGTGCCTAAGGGGCACAGCTCGTCTGACTGCCTGCCT 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGTGTCTACACTTCCCTGGATATGACTCATCCCTATGGCTCGTGGTACAAGGCAGGCATC 

HisAlaGlylleSerProGlyProGlyAsnThrProThrProTrpTrpAspMetHisPro 
I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
C AC GC AGGCATC TC ACCAGGT CC AGGCAAC AC AC C T AC TC C T TGGTGGGAC ATGC AC C C T 



i I I M M Mil I I I t MM I I I I I II I I I II I I I III I I M I I II I I M I I II I I 

GGGGGCAAC TGGC TAGGTGGTGGT C AGGGC CAGGGT GATGGGC T GC AAGGGAC AC T GT C C 

ThrGlyProAlaGlnProProLeuAsnProGlnLeuProThrTyrProSerAspPheAla 
M I I I I M I I I M I I I I II I I I I I I 1 I I | | | | | | | | | | | | | | | | | | | I | | | | | | | | | | | 1 
ACAGGCCCTGCCCAGCCTCCACTGAACCCCCAGCTGCCTACTTACCCATCTGACTTTGCT 



212 



60 



272 



80 



332 



100 



392 



120 



452 



140 



512 



160 



572 



180 



632 



200 



692 



201 ProLeuAsnProAlaProTyrProAlaProHisLeuLeuGlnProGlyProGlnHisVal 220 
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a age \j ui *t j 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


693 


CCCCTTAACCCAGCTCCCTACCCAGCGCCCCACCTCTTGCAACCAGGGCCCCAGCATGTC 


752 


Qy 


221 


LeuProGlnAspValTyrLysProLysAlaValGlyAsnSerGlyGlnLeuGluGlySer 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


753 


CTACCCCAAGATGTC TATAAGCCC AAGGCGGT TGGC AATAGTGGGCAAC TGGAGGGGAGT 


812 


Qy 


241 


GlyAlaAlaLysProProArgGlyAlaGlyThrGlyGlySerGlyGlyTyrAlaGlySer 


260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 




Db 


813 


GGTGCAGCCAAACCCCCTCGGGGTGCTGGCACAGGGGGCAGCGGTGGATATGCGGGCAGT 


872 


Qy 


261 


GlyAlaGlyArgSerThrCysAspCysProAsnCysGlnGluLeuGluArgLeuGlyAla 


280 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


873 


GGGGC AGGGC G T TC T ACC T GCGAC TGCC C C AAC TGT CAGGAGC T AGAGCGGC T CGGGGC A 


932 


Qy 


281 


AlaAlaAlaGlyLeuArgLysLysProIleHisSerCysHisIleProGlyCysGlyLys 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | M 1 1 1 1 1 1 




Db 


933 


GC AGC GGC TGGGC TGAGGAAGAAGCCC AT TC AC AGC T GCC AC ATCCC T GGGT GCGGCAAG 


992 


Qy 


301 


ValTyrGlyLysAlaSerHisLeuLysAlaHisLeuArgTrpHisThrGlyGluArgPro 


320 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 




Db 


993 


GTGT AC GGCAAGGC T T C GC ATC TGAAAGC CC ACT T GC GC T GGC AC AC T GGC GAGAGGC C T 


1052 


Qy 


321 


PheValCysAsnTrpLeuPheCysGlyLysArgPheThrArgSerAspGluLeuGluArg 


340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




Db 


1053 


T TC GTC T GCAACTGGC T T T TC TGC GGCAAGAGGT TC AC TC GC TC TGAC GAGC TGGAGCGC 


1112 


Qy 


341 


HisValArgThrHisThrArgGluLysLysPheThrCysLeuLeuCysSerLysArgPhe 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1113 


C AC GTGC GC AC TCAC ACCCGGGAGAAGAAGT T C AC C TGCC TGC T C TGT TCCAAGCGC T T T 


1172 


Qy 


361 


ThrArgSerAspHisLeuSerLysHisGlnArgThrHisGlyGluProGlyProGlyPro 


380 






1 1 1 1 1 1 f 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1173 


AC CAGAAGC GACCAC T TGAGCAAAC ATC AGCGC ACC C AC GGGGAGC CAGGC CCGGGACCG 


1232 


Qy 


381 


ProProSerGlyProLysGluLeuGlyGluGlyArgSerValGlyGluGluGluAlaAsn 


400 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1233 


C CC CCAAGTGGC C C T AAGGAGC TGGGGGAGGG T C GC AGC GTCGGGGAAGAAGAAGC C AAT 




Qy 


401 


GlnProProArgSerSerThrSerProAlaProProGluLysAlaHisGlyGlySerPro 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1293 


CAGCCGCCCCGATCTTCCACTTCGCCTGCACCCCCAGAAAAAGrrrarr^Aaf^rnrzrrr'Zi 




Qy 


421 


GluGlnSerAsnLeuLeuGluIle 428 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1353 


GAGCAGAGCAACC TGC TAGAGATC 137 6 




RESULT 2 








AK077375 








LOCUS 


AK077375 2874 bp mRNA linear HTC 02-SEP- 


2005 


DEFINITION Mus musculus 6 days neonate head cDNA, RIKEN full-length enriched 




library, clone : 5430403B13 product : osterix, full insert sequence 




ACCESSION AK077375 




VERSION 


AK077375.1 GI: 26346245 




KEYWORDS 


HTC; CAP trapper. 




SOURCE 


Mus musculus (house mouse) 





ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
REFERENCE 1 
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